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Abstract —A non-parametric technique for modeling the be¬ 
havior of power amplifiers is presented. The proposed technique 
relies on the principles of density estimation using the kernel 
method and is suited for use in power amplifier modeling. 
The proposed methodology transforms the input domain into 
an orthogonal memory domain. In this domain, non-parametric 
static functions are discovered using the kernel estimator. These 
orthogonal, non-parametric functions can he fitted with any 
desired mathematical structure, thus facilitating its implementa¬ 
tion. Furthermore, due to the orthogonality, the non-parametric 
functions can he analyzed and discarded individually, which 
simplifies pruning basis functions and provides a tradeoff be¬ 
tween complexity and performance. The results show that the 
methodology can be employed to model power amplifiers, therein 
yielding error performance similar to state-of-the-art parametric 
models. Furthermore, a parameter-efficient model structure with 
6 coefficients was derived for a Doherty power amplifier, therein 
significantly reducing the deployment’s computational complex¬ 
ity. Finally, the methodology can also be well exploited in digital 
linearization techniques. 

Index Terms —Power amplifier, non-parametric model, ker¬ 
nel, basis functions, power amplifier linearization, Digital pre¬ 
distortion. 

I. Introduction 

NERGY-EFFICIENT power amplifiers (PAs) in wireless 
networks usually behave in a nonlinear fashion, thereby 
producing significant nonlinear distortions that degrade net¬ 
work performance. This creates the need for suitable behav¬ 
ioral models for PAs that provide simpler descriptions of 
nonlinear distortion mechanisms and tools for the mitigation 
of these effects such as digital pre-distortion (DPD) techniques 

m. 

Historically, behavioral models for PAs have been derived 
using the Volterra series El. The disadvantage of the Volterra 
series is that it involves a large number of parameters, which 
hinders its practical implementation. Pruning Volterra series 

Copyright ©2015 IEEE. Personal use of this material is permitted. How¬ 
ever, permission to use this material for any other purposes must be obtained 
from the IEEE by sending a request to pubs-permissions@ieee.org. 

E. Zenteno and Z.A Khan are with the Department of Electronics, Math¬ 
ematics, and Natural Sciences, the University of Gavle, SE 80176 Gavle, 
Sweden, and also with the Department of Signal Processing, Royal Institute 
of Technology KTH, SE 10044 Stockholm, Sweden (e-mail: efnzeo@hig.se, 
zanahn@hig.se). 

M. Isaksson is with the Department of Electronics, Mathematics, and 
Natural Sciences, the University of Gavle, SE 80176 Gavle, Sweden (e-mail: 
min@hig.se). 

P. Handel is with the Department of Signal Processing, Royal Institute of 
Technology KTH, SE 10044 Stockholm, Sweden, (e-mail: ph@kth.se) 


has been actively studied to provide low-complexity and 
high-performance behavioral models to mitigate PA nonlinear 
distortions 0 , a, 0 . Although pruning Volterra series has 
produced useful empirical models, these pruned models are 
general structures for smaller classes of nonlinear systems. 
This requires engineers to test different pruned model struc¬ 
tures and further select the nonlinearity order and memory 
depths to meet certain performance requirements with a level 
of complexity that depends on application constraints. Hence, 
for a specific PA, trimming the pruned Volterra models may 
produce even lower complexity with the desired error perfor¬ 
mance 0,0. This raises the question as to whether there may 
exist techniques to obtain structural knowledge of a specific PA 
that in turn can be used to construct simpler model structures 
with the required model error performance. This paper presents 
a technique of this class. 

Trimmed model structures of reduced complexity can also 
be obtained using sparse estimation techniques 0, 0. How¬ 
ever, sparse estimation techniques are usually computationally 
demanding and require the choosing of an initial model to 
be reduced. On the one hand, a general model is desired as 
an initial set that preserves the modeling properties. However, 
this involves a large set, which increases the complexity of 
the technique. On the other hand, starting from a small class 
of model structures and reducing complexity produces results 
that are dependent on this initial choice. 

This paper presents a non-parametric method of discovering 
PA structural information. Thus, it assumes no a priori model 
structure for the PA. The proposed method considers static and 
dynamic distortion effects and provides a tool for analyzing 
the PA transfer function. In particular, the tool can be used to 
tailor parametric models of simpler forms. Thus, the method 
effectively reduces the computational complexity of the model. 
In PA modeling, other non-parametric techniques use statisti¬ 
cal functions such as cumulative distribution functions (CDFs) 
0, 0 and higher order statistics ifTOl . However, 0, 0, and 
Eol consider solely memoryless distortion effects, and hence, 
they are ineffective at characterizing and compensating PA 
distortion caused by memory effects. 

The proposed technique is based on non-parametric density 
estimation im referred to as the kernel smoothing estimator 
method or simply the kernel method ifT^ . Compared to 
polynomial-based PA models, the kernel method can estimate 
nonlinear functions of high nonlinearity order without nu¬ 
merical difficulties JH. Furthermore, the kernel method uses 
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window averages, which are less computationally demanding 
than the matrix (pseudo) inversions required in the least square 
methods. Finally, the kernel estimator has strong statistical 
properties: asymptotic convergence m, optimal estimation 
in the square error sense given a limited number of samples 
and robustness against noise sources lEl. All these properties 
make the kernel method a suitable candidate for modeling PAs. 

The work reported in this paper reviews the modeling 
methodology presented in m and performs the adapta¬ 
tions necessary for the PA measurement scenario. PAs are 
characterized by band-limited, complex baseband equivalent 
signals, which make the method in m unsuitable for PA 
modeling. However, with the adaptations proposed in am 
and our previous study ifTSIl . we obtain a methodology and 
method suitable for this application. In contrast to traditional 
PA modeling techniques, the work reported here transforms 
the input sample domain into an orthogonal domain, where 
the model structure is obtained using the kernel method. The 
orthogonal domain simplifies the analysis of the PA transfer 
function; allowing the addition or removal of basis functions 
provides a tradeoff between complexity and performance. This 
result can be transferred to the original sample domain, thereby 
reversing the orthogonalization process (linear combination) 
and obtaining model structures that are comparable in per¬ 
formance with the state-of-the-art methods but with reduced 
computational requirements for deployment. 



Fig. 1. Illustration of the estimation of the function g{-) at fixed grid Xi 
through a triangular kernel ip(-). The value g{xi) is obtained as the weighted 
average of the output data z through the kernel i^(-). 

ELi rr) = ELi d!(M+ild)! - with p being the highest 
dimension of the functions in ([T]) and ! denoting the factorial 
operator. The high dimensionality increases the computational 
complexity of the identification and deployment of the models. 
Thus, we analyze the relationship in ([T]l and study possible 
simplifications to allow it to be suitable for PA modeling. 


II. PA Modeling 

A. PA model 

Let u{n) and y{n) denote the n-th complex-valued sample 
of the baseband signals corresponding to the input and output 
of a PA, respectively. The PA nonlinear transfer function is 
approximated by ifThll 

y(^) = X! “ mi))+ 

mi 

+ ^ ^ fmi,m 2 {u{n - mi),u{n - m 2 )) + ... 

mi m 2 

T 'y ' ... 'y ' fmi ,m 2 ,...,mp (u(n mi), ...,u(^Tl Trip)), 
mi irip 

( 1 ) 

where fmi{-), /mi.m 2 (-, •) and /mi.m 2 ....,mp(', ■■■,•) are non¬ 
linear static functions whose domain dimensions are 1,2 
and p, respectively. The summations go up to M subject to 
0 > mi > m 2 > ... > rrip > M, where M is the maximum 
memory depth considered. 

The Volterra series is a special form of ([T]), which can be 
obtained when /mi(')> /mi,m 2 (-> O and /mi.m 2 ,....mp(-, ■•■,•) 
are defined as the scaled product of their arguments. The 
static functions in ([T]| can represent high nonlinearity orders 
of the Volterra series. In particular, high nonlinearity orders 
are coupled to different memory depths, which is the cause 
of the rapid growth in the number of parameters in the 
Volterra series. Despite the different features of ([T]i compared 
to the Volterra series, both suffer from high dimensionality. 
Considering 0 > mi > m 2 > ... > rrip > M, the 
system in O has a total number of additive functions of 


B. Kernel method brief 

The kernel estimator as applied to the estimation of a 
static nonlinear input output relation ifT^ is briefly reviewed. 
Consider the set of real-valued input data {a:(n)})))rQ^ passed 
through an unknown, static nonlinear function p(-) and pro¬ 
ducing the output {z{n)}^~Q, that is, z(n) = g{x{n)) for 
n = 0,..., A^—1. Then, the static function g{-) can be estimated 
at a scalar point Xi as illustrated in Fig. [T] by the kernel 
(window) average m 


9{xf) = 


N-1 

E 


iff 


c(n)—a: 


V— vA^—1 ( 

=0 ^^=0 


c(^)-a 


zin), 


( 2 ) 


where the grid of points Xi for i = 1, ...,T span the amplitude 
support of x{n) and ip{-) is the kernel with aperture 5. Here, 
the triangular kernel is preferred because it is the minimum 
mean square error estimator for a limited number of samples 
and is robust against noise CSl, that is. 


ip{x) 


1 — |a;| if |a;| < 1 
0 if |a;| > 1, 


(3) 


with I • I denoting the absolute value. Equation (|2| is evaluated 
only if the denominator is different from zero. In this paper, 
a linear interpolation between the two nearest neighbors is 
employed to compute g{-) for an arbitrary input within the 
amplitude support of x(n). 

The kernel method is not directly suited to simulating PA 
behavior. First, PA measurements show a significant corre¬ 
lation between different samples of the input signal u{n). 
The sample correlation makes the output of the functions 
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fmi,mA-r) and ■ • ■, jointly corre¬ 

lated. Thus, their estimation needs to simultaneously account 
for all of them. 

Second, the kernel method is intended for real-valued data. 
For complex-valued data widely available within PA instru¬ 
mentation, the static functions in ([TJ are functions of complex¬ 
valued inputs and outputs ITSl . This results in the method 
having large computational and storage requirements. In the 
following, a method that addresses these drawbacks is outlined 
and discussed with simplifications (complexity reductions) of 
([T]i suitable for PA modeling. 


C. Removing correlation by orthogonalization 

The PA input signal is band limited and digitized with over- 
sampling, yielding u{n). As a result of the oversampling, this 
discrete signal has a significant correlation between different 
samples. However, removing the correlation in u{n) can be 
viewed as orthogonalizing it im, which can be efficiently 
performed using the Gram-Schmidt (GS) process. 

According to (ID, the signal set to be orthogonalized lies in 
the space 

U = {u{n ),..., u{n — M)} . (4) 

The GS process yields an orthogonal set 14 = 

{u{n),... ,u{n — M)}, where u{n) = u{n), followed 

by an iterative process for fc = 1,..., M, 

k-l 

u{n — k) = u{n — k) — Pk^iu{n — £). (5) 

e=o 

The scalar Pk^e is a projection of the signal u{n — £) over 
u{n — k) defined by 

Pk,i = '^u*{n-k)u{n-£), (6) 

n 

where * denotes the complex conjugate operator. Note that 
the GS process involves a linear combination, and thus, it 
can be reversed without any loss of information. Assuming 
that u{n) is a wide-sense stationary stochastic process with 
k-th auto-correlation lag denoted by ru{k), we note that the 
projections can be a priori calculated using ru{k). This leads 
to a computationally preferable method; e.g., a rectangular¬ 
shaped power spectral density of a Long-Term Evolution 
(LTE) signal yields a sine-shaped auto-correlation function. 


D. Real-valued PA input signal 

The distortion produced by PAs operating within wireless 
networks can be regarded as amplitude dependent ID, ifTSll . 
Thus, considering solely the amplitude of the signals in the 
orthogonal input set U yields the set 

\14\ = {\u{n)\,...,\u{n-M)\}, (7) 

which will be the input to our kernel estimator, e.g., x{n) = 
|it(n)|. To compensate for the phase contribution, the output 
signal y{n) is transformed as 


( 8 ) 


By applying the GS process to the input set 14 followed by 
the real-value transformation, the system O becomes 

mi 

+ '^'^gnii,m2ixin - mi),x{n - m2)) + ... 

mi m 2 

+ ^ ^ - mi ),..., x{n - mp)), 

mi mp 

(9) 

with complex-valued gmA'), 5 mi,m 2 (-,-) and 
5 mi,m 2 ....,m {',■■■■,') as the orthogonal counterparts of 
the functions in O but with real-valued arguments. This 
reduces the estimation dimension required in the kernel 
method. The system has similar features as ([T]i for 
modeling nonlinear behavior. However, in contrast to O, it 
has orthogonal basis functions. Thus, their estimation can be 
performed individually, and each basis contribution can be 
separately analyzed. 


E. Complexity reduction of ([TJ 

Despite using real-valued input signals, the complexity of 
the estimation of a multi-variable function in ® remains high; 
e.g., a p-th variable function estimated at T points for each 
variable gives a total of estimation points. Thus, the mem¬ 
ory requirements and data manipulation increase exponentially 
with p, leading to the well-known curse of dimensionality 
problem. In an attempt to alleviate this, a p-variable function 
...,•) is approximated as a sum of single-variable 

functions: 

gTni,...,mpix{n - mi), ...,x(n - mp)) « 

p p 

hmk{x{n - TUk) x{n - rud))- 

k^l 

d^k 

Thus, the single-variable functions hmA') estimated 

using (|2j. In PA modeling, (ITOJ has been motivated from a 
physical insi and signal processing perspective iflTl . Note 
that the new single-variable x{n — mu) x{n — mf) can 

d^k 

similarly be considered in the non-orthogonal domain 14 by 
augmenting it as 


W = 


p 

U, {u{n - mk) \u{n - 

d^k 


( 11 ) 


for p = 2,..., M. The orthogonalization of the data set lA' is 
performed using the GS procedure. 


F. Summary and implementation 

Consider the data set of complex-valued input and output 
measurements {u(n)} and {y{n)}, respectively. The non- 
parametric modeling approach begins by creating the input 
space 14' as indicated by (HB for the chosen maximum 
memory depth M. The implementation of the method can 
proceed by storing 14' as a matrix whose columns are the basis 
in (HU- This matrix is column-wise orthogonalized using the 


z{n) = y{n)e 
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GS process, therein yielding an orthogonal matrix. Only mag¬ 
nitude entries of the orthogonal matrix are retained according 
to O, rendering a magnitude matrix. Finally, each column 
of the magnitude matrix is used as a domain to estimate 
single-variable functions with the kernel method in (|2l). The 
results show that significant contribution to the model output 
continues to originate from a few single-variable functions. 
Thus, due to the orthogonality, the non-contributing functions 
can be eliminated from the model structure (remove the 
corresponding columns) while retaining the obtained model 
performance. 

III. Experimental 

A. Measurement setup 

The measurement setup includes a vector signal generator 
R&S SMU 200A that is used to excite the PA. The PA 
output is measured using a wideband down converter and 
a high-performance analog-to-digital converter (ADC) with 
14-bit resolution operated with a 400 MHz sampling rate. 
The amplifier being tested is the MRF8S21120HS Doherty 
amplifier with 14 dB linear gain, an operation frequency in 
the 2.1-2.2 GHz band, and a rating of 46 dBm output power 
operated at approximately 3 dB of compression. 

Two independent excitation signals with bandwidths of 12 
and 24 MHz are generated. These excitations are noise-like 
signals with peak-to-average power ratios of 11.2 and 11.4 
dB, respectively. The excitations were created in a PC using 
10® complex-valued samples uploaded to the generator and up- 
converted to 2.14 GHz to excite the PA. The measurements 
consist of 10® complex-valued samples for the input and 
the output of the amplifier with post-processing time and 
phase delay compensation IT]. The non-parametric structure is 
obtained using 10% (estimation phase) of the measured data, 
and the remaining 90% (validation phase) is used to evaluate 
the modeling error. 

B. Results 

1) User-defined parameters: In the proposed method, the 
number of grid points T and the kernel aperture 5 are user- 
defined parameters. Fig. |2] shows the normalized mean square 
error (NMSE) contours over both 5 and T in a linearly spaced 
grid. The kernel aperture 5 is shown as a percentage of the 
span of the input signal. 

The number of grid points T sets the resolution of the 
static function and the kernel aperture S sets the size of the 
input neighborhood to perform the average (estimation) (cf. 
Fig. [B- Thus, a large value of T and a small value of 6 are 
desired to produce an accurate estimation. However, for a fixed 
number N of measurement samples, decreasing 6 may degrade 
the performance because the number of measurement samples 
in each kernel function decreases, and hence, its average 
(estimate) has increased variance (less reliability). This is the 
reason for the loss in performance for small values of 6 in 
Fig.|2l Because T is the number of entries to be stored, T can 
be chosen based on the available memory resources, and as a 
rule of thumb, the kernel aperture can be set as (5 = 1/T to 
avoid the performance degrading effects, cf. Fig.|2l The choice 
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Fig. 2. NMSE (in dB) as a function of the number of grid points T and the 
kernel aperture 5 (as a percentage of the span of the input signal). 

TABLE I 

Individual contributions of the functions for two different 

SIGNAL BANDWIDTHS 


Basis 

12 MHz 

24 MHz 

NMSE 

ACEPR 

NMSE 

ACEPR 


[dB] 

[dB] 

[dB] 

[dB] 

9oi-) 

-38.3 

-54.9 

-32.5 

-50.8 

9li-) 

-5.0 

0.0 

-6.7 

-0.4 

hi-) 

-0.3 

0.0 

-2.3 

0.3 

hi-) 

-0.0 

0.0 

-0.4 

0.0 

90,l(-) 

-0.1 

-0.3 

-0.5 

-0.2 

90,2(-) 

-0.1 

0.0 

-0.1 

0.1 

90,sO) 

-0.0 

0.0 

-0.0 

0.0 

9l,2(-) 

-0.0 

0.0 

0.0 

0.0 

9l,3(-) 

0.0 

0.0 

0.0 

0.0 

92,30) 

0.0 

0.0 

0.0 

0.0 

90,1,20) 

-0.1 

0.0 

-0.1 

-0.0 

90,1,30) 

0.0 

0.0 

0.0 

0.0 

90,2,30) 

0.0 

0.0 

0.0 

0.0 

91,2,30) 

0.0 

0.0 

0.0 

0.0 

Total 

-43.9 

-55.2 

-42.6 

-51.0 


(5 = 1 /T has the advantage of efficiently using all training data 
for estimating the non-parametric model. 

2) Modeling performance: Using T = 70 and S = 1/70, 
a non-parametric model of the PA is obtained for the two 
input signals under consideration. Table U shows the individual 
contributions of the basis functions to the NMSE and to the 
adjacent channel error power ratio (ACEPR) fT] for these two 
signals. As observed in Table |I] the function go{x) contributes 
-38.3 and -32.5 dB of the total NMSE for the 12 and 24 
MHz signals, respectively. This function is the largest model 
contributor because it captures linear and nonlinear static 
effects. 

For the 12 MHz signal, the NMSE is dominated by the 
contributions of go{x) and gi{x), which provide a combined 
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Fig. 3. Amplitude and phase (in degrees) of the estimated single-variable 
functions go{-), and g 2 {-) for the 24 MHz signal (solid - blue) and 

go{') and gi(-) for the 12 MHz (dashed - red) signal. The function g 2 (') is 
not presented for the 12 MHz signal because its contribution is negligible. 


NMSE of -43.3 dB. However, in the 24 MHz input signal, the 
contribution to the NMSE from the function 52 ( 21 ) increases 
from -0.3 to -2.3 dB, thereby revealing the impact of memory 
effects caused by the increase in signal bandwidth. Note 
that the contributions from the 2- and 3-variable functions 
is negligible in the 12 MHz case, and 50,1 (•) significantly 
increases its contribution when increasing the signal bandwidth 
from 12 to 24 MHz from -0.1 to -0.5 dB. In terms of the 
ACEPR, only 50 ( 21 ) provides a significant contribution for both 
signal bandwidths. The static nonlinear distortion is modeled 
by go{-), and the linear and nonlinear dynamics are described 
by 5 i(-) and 52 (-)- Because these functions are arbitrary, the 
non-parametric structure can model memory effects coupled 
to strong nonlinearities, which are one of the causes of poor 
behavior in polynomial-based model methodologies. 

The functions contributing more than -2 dB to the NMSE 
are shown in Eig. [3 for both signal bandwidths. Despite the 
two signals being independently created and having different 
bandwidths, the estimated functions are similar to each other, 
which suggests that the method obtains structural information 
about the modeled PA. 

The power spectral density (PSD) of the input, output and 
model error evolution are plotted in Pig. H for the 24 MHz 
bandwidth signal. The model was updated sequentially to 
include the first six basis functions of Table U The in-band 
error spectrum decreases with the addition of basis functions, 
and the out-of-band error is suppressed by the use of go{x). 
These two observations are in accordance with Table |T] 

An advantage of the proposed method is that we can utilize 
the estimated basis functions (cf. Pig. |2l to build a parametric 
model of the PA. These parametric models can be of any form; 
they can be chosen to ease implementation and identification 
or to maximize performance. We seek a parameter efficient 
representation using a polynomial family as an example. Thus, 
the function 5 o(-) is modeled with a seventh-order polynomial. 
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Fig. 4. Power spectral density of the output and error model obtained by 
sequentially including the first 6 basis functions indicated in Table J] in the 
nonparametric model for the 24 MHz signal. 


and ^i(-) and g 2 {-) are modeled with linear polynomials (cf. 
Pig.H. that is, 

4 

z{n) = y^7pM(n)|u(n)|^^^“^^ + 7514(21 - 1) + 7514(14 - 2). 

p=i 

( 12 ) 

Due to the orthogonality, the remainder of the functions in 
Table U can be discarded without affecting the model perfor¬ 
mance. Purthermore, by replacing the orthogonal variables for 
the linear combination of their non-orthogonal counterparts 
given in the GS process (Section III-CI) . the parametric de¬ 
scription of this model becomes 
4 

y{n) = apu{n)\u{n)\‘^^^~^'' + a^u{n — 1) + aeu{n — 2). 

p=i 

(13) 

This 6 -parameter model [ai, ... ,ae] can be identified using 
linear regression techniques, which are commonly used in PA 
modeling ||4|. 

Pig. |5] shows the NMSE performance versus the complexity 
incurred when using the feed-forward model. The complexity 
is measured in floating point operations (PLOPs) ll20l . Pig. 
|5] compares the proposed method with several parametric 
models, such as the static nonlinear, memory polynomial, gen¬ 
eralized memory polynomial ||4|, Multi-LUT II 2 TI . Volterra ||2| 
and Kautz-Volterra models, and non-parametric models, 

such as the Histogram model |0. Different points correspond 
to different model settings (nonlinearity order and memory 
depth) being tested. 

Although, in general, it is possible to trade reduced NMSE 
for increased complexity, these settings need to be chosen with 
care to obtain optimal performance for the level of complex¬ 
ity incurred, as observed by the performance dispersion in 
Pig. |5] Moreover, some model settings with increased model 
complexity degrade the NMSE performance, as observed in 
Pig. 121 which is due to an unsuitable model being chosen. Por 
example, a static model of high nonlinearity order may have 
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Fig. 5. NMSE of the 24 MHz signal versus the number of FLOPS incurred 
when using the feed-forward model for several modeling techniques. The 
kernel method uses S = l/T, T = 70 with M memory depth. Different 
NMSE values are obtained by changing the settings in the methods. 


large complexity but remains unable to model dynamic effects, 
thereby providing limited performance. Similar arguments can 
be made for memory depths. Moreover, these detrimental 
effects have been discussed in previous studies ll20l . The 
proposed kernel method has good performance/complexity 
compared to state-of-the-art parametric models. Finally, the 
proposed method was used to construct a parameter-efficient 
structure (6-parameter model), which provides the best NMSE 
for its reduced level of complexity because it was specihcally 
tailored for the PA. 


IV. Digital Pre-distortion (DPD) 

The proposed method is tested as a pre-distorter compen¬ 
sating for nonlinear distortions at the PA output. The non- 
parametric structure is obtained using an inverse learning 
architecture, in which input and output are interchanged ll2^ . 
To increase efficiency, a clipping technique ll24l has been 
applied to the 24 MHz input signal, therein reducing its PAPR 
from 11.4 to 8.8 dB. However, care must be exercised because 
clipping techniques introduce in-band and out-of-band errors. 

From (|9]l, the function (?o(') of the DPD model has to be the 
inverse of the same function in the feed-forward model. For 
the remainder of the functions in the DPD model, they have to 
be the negative of their counterparts in the feed-forward model 
(same amplitude but with the phase shifted by 180 degrees). 
This is depicted in Fig. |6] where the feed-forward and inverse 
(DPD) estimated non-parametric functions are plotted. 

The PA operates at -25 dB of NMSE and -36 dB of 
ACPR, respectively, without DPD. The pre-distorted PA with 
the outlined method produces an NMSE and ACPR of -42 
and -49.5 dB, respectively, which shows its effectiveness in 
compensating nonlinear distortion. 


Fig. 6. Amplitude and phase (in degrees) of the estimated functions of the 
feed-forward model in solid blue and inverse (DPD) model in dashed green. 
The inverse model was estimated using the inverse learning architecture oa 


V. Conclusions 

A non-parametric method of modeling RF power amplihers 
is presented. The method does not assume an a priori model 
structure of the PA. Thus, basis functions that describe its 
behavior are estimated during the identihcation process, lead¬ 
ing to the development of tailored parametric models. These 
tailored models can be htted with any desired structure, which 
eases its implementation. In particular, parameter-efficient 
models with small errors can be obtained, thereby reducing 
the implementation and deployment computational costs. 

The presented method is based on the kernel estima¬ 
tor, which solely performs sample averages and hence does 
not suffer from numerical instabilities. Furthermore, adap¬ 
tive schemes can be made using running averages, which 
require low computational resources and feature real-time 
implementations. The proposed methodology can lead to the 
low computational resource implementation of look-up tables 
(FUTs) for adaptive digital pre-distortion (linearization). 
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